How Decoder-Only Transformers (like GPT) Work